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DESCRIPTION 

RECORDING APPARATUS , RECORDING METHOD, REPRODUCING 



Technical Field 

The present invention relates to a recording 
apparatus and a recording method for recording a 
digital video signal to a tape shaped record medium and 
also to a reproducing apparatus and a reproducing 
method for reproducing a digital video signal from a 
tape shaped record medium. 
Background Art 



Recorder), a data recording and reproducing apparatus 
that records a digital video signal and a digital audio 
signal to a record medium and reproduces them therefrom 
is known. Since the data capacity of a digital video 
signal is huge, conventionally, it is compression- 
encoded corresponding to a predetermined method and 
then the encoded data is recorded to a record medium. 
In recent years, MPEG2 (Moving Picture Experts Group 
phase 2) is known as a compress ion -encoding standard. 



the above-mentioned MPEG2, the data compression ratio 
is improved using variable length code. Thus, 
depending on the complexity of a picture that is 
compressed, the amount of compressed code per screen 
(for example, per frame or per field) fluctuates. 



APPARATUS, AND REPRODUCING METHOD 



As exemplified with a digital VTR (Video Tape 



In picture compression technologies such as 



On the other hand, in a recording apparatus 
that records a video signal to a record medium such as 
a magnetic tape or a disc record medium, particularly, 
in a VTR, a predetermined unit such as one frame or one 
field is used as a unit of a fixed length. In other 
words , the amount of code per frame or field is limited 
to a predetermined value or less and recorded to a 
fixed capacity area of a storage medium. 

The reason why the fixed length format is 
used for a VTR is in that since each record area on a 
magnetic tape as a record medium is composed of one 
frame, record data for one frame should be just placed 
in each record area. In addition, since the record 
medium is used corresponding to record time, the total 
amount of record data on the record medium and the 
remaining amount thereof can be accurately obtained. 
As another advantage, a program start position 
detecting process can be easily performed in a high 
speed searching operation . In addition, from a view 
point of controlling of a record medium, if the record 
medium is a magnetic tape, when data is recorded in the 
fixed length format, since the magnetic tape that is 
dynamically driven can be traveled at a constant speed, 
the magnetic tape can be stably controlled. Likewise, 
these advantages can apply to disc shaped record 
mediums . 

The variable length code encoding format and 



the fixed length format have such contrary 
characteristics. In recent years, a recording 
apparatus that inputs a video signal as a non- 
compressed base band signal, compression-encodes the 
signal with variable length code corresponding to MPEG2 
or JPEG (Joint Photographic Experts Group), and records 
the encoded signal to a record medium is known. In 
addition, a recording and reproducing apparatus that 
directly inputs and outputs a stream that has been 
compression-encoded with variable length code and 
records and reproduces the stream has been also 
proposed. In the following description, it is assumed 
that the compression encoding format for a digital 
video signal is MPEG2 . 

Next, the structure of an MPEG2 data stream 
will be described in brief. MPEG2 is a combination of 
a motion compensation predictive encoding and a 
compression encoding using DCT. MPEG2 data is 
hierarchically structured. The MPEG 2 data is composed 
of a block layer as the lowest layer, a macro block 
layer, a slice layer, a picture layer, a GOP (Group Of 
Picture) layer, and a sequence layer as the highest 
layer . 

The block layer is composed of DCT blocks 
each of which is a data unit for DCT. The macro block 
layer is composed of a plurality of DCT blocks. The 
slice layer is composed of a header portion and at 



least one macro block. The picture layer is composed 
of a header portion and at least one slice- One 
picture corresponds to one screen. The GOP layer is 
composed of a header portion, an I picture (Intra-coded 
5 picture), a P picture (Predictive-coded picture), and a 

B picture (Bidirectionally predictive -coded picture). 

The I picture uses information of only a 
picture that is encoded. Thus, the I picture can be 
decoded as it is. The P picture uses an I picture or a 
JLO P picture that has been decoded before the current P 

2 picture is decoded. The difference between the current 

~j P picture and the motion compensated predictive picture 

is encoded or the current P picture is encoded without 
the difference. One of them is selected for each macro 
15 block depending on which is more effective. The B 

picture uses (1) an I picture or a P picture that has 
been decoded before the current B picture is decoded, 
(2) an I picture or a P picture that has been decoded 
before the current B picture is decoded, or (3) an 
20_ interpolated picture of (1) and (2). The difference 

between the current B picture and each of the three 
types of the motion compensated predictive pictures is 
encoded or the current B picture is encoded without the 
difference. One of them is selected for each macro 
25 block depending on which is the most effective. 

Thus, as types of macro blocks, there are an 
intra- frame encoded macro block, a forward inter- frame 



predictive macro block of which a future macro block is 
predicted with a past macro block, a backward inter- 
frame predictive macro block of which a past macro 
block is predicted with a future macro block, and a 
bidirectional macro block that is predicted in both the 
forward and backward directions. All macro blocks in 
an I picture are all intra-frame macro blocks. A P 
picture contains an intra-frame macro block and a 
forward inter-frame predictive macro block. A B 
picture contains all the four types of macro blocks. 

A macro block is a set of a plurality of DCT 
blocks and formed by dividing one screen (picture) into 
a lattice of 16 pixels x 16 lines. A slice is formed 
by connecting macro blocks for example in the 
horizontal direction. The number of macro blocks per 
one screen depends on the size thereof. 

In the MPEG format, one slice is one variable 
length code sequence. The variable length code 
sequence is a sequence of which the boundary of data 
cannot be detected unless variable length code is 
correctly decoded. When an MPEG stream is decoded, the 
header portion of a slice is detected so as to obtain 
the start point and the end point of variable length 
code . 

In MPEG, conventionally, one slice is 
composed of one stripe (16 lines). The variable length 
encoding starts at the left edge of the screen and ends 



at the right edge of the screen. Thus, when a VTR has 
recorded an MPEG elementary stream, if it is reproduced 
at high speed, the VTR mainly reproduces the left edge 
of the screen. Thus, the screen cannot be equally 
5 updated. In addition, since the position on the tape 

cannot be predicted, if a tape pattern is traced at 
predetermined intervals, the screen cannot be equally 
updated. Moreover, if at least one error takes place, 
it adversely affects until the right edge of the screen . 

J.0 Thus, until the next slice header is detected, the 

error continues. Thus, when one slice is preferably 

;f composed of one macro block, such an inconvenience can 

HF be solved. 

"1 On the other hand, a video signal is recorded 

015 on a magnetic tape in helical track format of which 

y tracks are diagonally formed with a rotating head. On 

rj one track, sync blocks, each of which is the minimum 

record unit, are grouped for each data type as sectors. 
In addition, data for one frame is recorded as a 
20„ plurality of tracks. 

In MPEG, to allow data to be accessed at 
random, a GOP (Group Of Picture) structure as a group 
of a plurality of pictures is defined. The provisions 
with respect to GOP in MPEG state that firstly the 
25 first picture of a GOP as a stream is an I picture and 

that secondly the last picture of a GOP in the order of 
original pictures is an I picture or a P picture. In 
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addition, as a GOP, a structure of which a prediction 
using the last I picture or P picture of an earlier GOP 
is required is permitted. A GOP that can be decoded 
without need to use a picture of an earlier GOP is 
referred to as closed GOP. 

In a digital VTR, an editing process is 
normally performed. The editing process is preferably 
performed in as small data unit as possible. When an 
MPEG2 stream has been recorded, one GOP may be used as 
an edit unit. In the structure of a closed GOP of 
which a GOP can be decoded without need to use an 
earlier GOP or a later GOP, an editing process can be 
performed for each GOP. However, when a GOP is 
composed of for example 15 frames, the editing unit is 
too large. Thus, it is preferred to perform an editing 
process in the accuracy of frame (picture). 

However, when an MPEG stream contains a 
predictive picture that requires an earlier picture or 
both an earlier picture and a later picture for 
decoding the predictive picture, it becomes impossible 
to perform the editing process for each frame. Thus, 
preferably, all pictures are encoded with intra -frame 
code and one GOP is composed of one intra-picture. 
Such a stream satisfies the encoding syntax of MPEG2. 

In addition, at the beginning of each of the 
sequence layer, the GOP layer, the picture layer, the 
slice layer, and the macro block layer, identification 



code composed of a predetermined bit pattern is placed. 
The identification code is followed by a header portion 
that contains encoding parameters of each layer. An 
MPEG decoder that performs an MPEG2- decoding process 
extracts identification code by a pattern-matching 
operation, determines the hierarchical level, and 
decodes the MPEG stream corresponding to the parameter 
information contained in the header portion. The 
header of each layer lower than the picture layer is 
information necessary for each frame. Thus, the header 
should be added to each frame. In contrast, the header 
of the sequence layer should be added to each sequence 
or each GOP. In other words, it is not necessary to 
add the header of the sequence layer to each frame. 

Information contained in the header of the 
sequence layer is number of pixels, bit rate, profile, 
level, color difference format, progressive sequence, 
and so forth. These information is normally the same 
in all the sequence when it is assumed that one video 
tape is one sequence. According to the encoding syntax 
of MPEG, the header of the sequence layer can be added 
at the beginning of the video tape. In addition, 
according to the encoding syntax of MPEG, a quantizing 
matrix may be present in the header of other than the 
sequence layer (namely, the header of the sequence 
layer or the header of the picture layer) . According 
to the encoding syntax of MPEG, the quantizing matrix 
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can be added or omitted* 

As information contained in the header of the 
picture layer, the accuracy of DC (Direct Current) 
coefficient of an intra macro block is set; the frame 
structure, field structure, and display field are 
designated; the quantizing scale is selected; the VLC 
type is selected; the zigzag/alternate scanning is 
selected; and the chroma format and so forth are 
designated. To allow an input picture to be 
effectively encoded corresponding to the characteristic 
thereof, the header of the sequence layer and the 
header of the picture layer can be changed for each 
frame. 

In a digital VTR, an MPEG stream is recorded 
on a magnetic tape with a rotating head. Diagonal 
tracks are successively formed on the magnetic tape. 
In the normal reproducing operation whose tape speed is 
the same as the recording operation, since all recorded 
data can be reproduced, even if the header information 
is changed for each frame, no problem takes place. 
However, in the high speed reproducing operation whose 
tape speed is higher than the recording operation (for 
example, twice or higher), since data of the tape is 
f ragmentarily reproduced, if information of the header 
is changed for each frame, a problem takes place. 

Fig. 26 conceptually shows reproduced data in 
the high speed reproducing operation. Data of each of 
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frame 1, frame 2, frame 3, . .. and so forth is composed 
of a header and picture data. There are a sequence 
header, a GOP header, and a picture header. The 
picture header is always added to each frame. In the 
high speed reproducing operation, data shaded in the 
drawing is f ragmentarily reproduced from each frame. 
The obtained data reproduces a picture of one frame. 

As was described above, to allow an input 
picture to be effectively encoded corresponding to the 
characteristic thereof, the header of the sequence 
layer and the header of the picture layer can be 
changed for each frame. Thus, if the header of frame 1 
is different from the header of picture data of another 
frame, frame 1 cannot be correctly decoded. 

Therefore, an object of the present invention 
is to provide a recording apparatus, a recording method, 
a reproducing apparatus , and a reproducing method that 
allow compression-encoded data f ragmentarily reproduced 
in the high speed reproducing operation to be decoded 
to a picture. 

Disclosure of the Invention 

Claim 1 of the present invention is a 
recording apparatus for recording a digital video 
signal to a tape shaped record medium, comprising a 
means for recording a stream in which a compression 
encoding has been performed and a header has been added 
to the tape shaped record medium, wherein information 
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of the header added to each frame is the same in all 
frames . 

Claim 5 of the present invention is a 
recording method for recording a digital video signal 
to a tape shaped record medium, comprising the step of 
recording a stream in which a compression encoding has 
been performed and a header has been added to the tape 
shaped record medium, wherein information of the header 
added to each frame is the same in all frames. 

According to claims 1 and 5 of the present 
invention, since information of a header is the same in 
all frames, even if data of a plurality of frames is 
f ragmentarily reproduced in the high speed reproducing 
operation, the reproduced data can be almost securely 
decoded. 

Claim 6 of the present invention is a 
recording apparatus for recording a digital video 
signal to a tape shaped record medium, comprising a 
means for recording a stream in which compression 
encoding has been performed and a header has been added 
to the tape shaped record medium, wherein a system area 
that is almost securely reproduced in a high speed 
reproducing operation of which the tape shaped record 
medium is traveled at higher speed than a recording 
operation is formed as an area separated from a record 
area for the stream, and wherein at least part of the 
header is recorded to the system area. 



Claim 10 of the present invention is a 
recording method for recording a digital video signal 
to a tape shaped record medium, comprising the step of 
recording a stream in which compression encoding has 
been performed and a header has been added to the tape 
shaped record medium, wherein a system area that is 
almost securely reproduced in a high speed reproducing 
operation of which the tape shaped record medium is 
traveled at higher speed than a recording operation is 
formed as an area separated from a record area for the 
stream, and wherein at least part of the header is 
recorded to the system area. 

Claim 11 of the present invention is a 
reproducing apparatus for reproducing a tape shaped 
record medium on which a stream has been recorded, in 
the stream, compression encoding having been performed 
and a header having been added, at least part of the 
header having been recorded in a system area that is 
almost securely reproduced in a high speed reproducing 
operation of which the tape shaped record medium is 
traveled at higher speed than a recording operation and 
that is formed as an area separated from a record area 
for the stream, wherein in the high speed reproducing 
operation, the reproduced stream is decoded using 
information contained in the header reproduced from the 
system area. 

Claim 15 of the present invention is a 



reproducing method for reproducing a tape shaped record 
medium on which a stream has been recorded, in the 
stream, compression encoding having been performed and 
a header having been added, at least part of the header 
having been recorded in a system area that is almost 
securely reproduced in a high speed reproducing 
operation of which the tape shaped record medium is 
traveled at higher speed than a recording operation and 
that is formed as an area separated from a record area 
for the stream, wherein in the high speed reproducing 
operation, the reproduced stream is decoded using 
information contained in the header reproduced from the 
system area. 

Claim 16 of the present invention is a 
recording apparatus for recording a digital video 
signal to a tape shaped record medium, comprising a 
means for recording a stream in which a compression 
encoding has been performed and a header has been added 
to the tape shaped record medium, wherein information 
of the header added to each frame is the same in all 
frames, wherein a system area that is almost securely 
reproduced in a high speed reproducing operation of 
which the tape shaped record medium is traveled at 
higher speed than a recording operation is formed as an 
area separated from a record area for the stream, and 
wherein at least part of the header is recorded to the 
system area. 



According to claims 6, 10, 11, and 15 of the 
present invention, since at least part of information 
of a header is recorded to a system area that can be 
almost securely reproduced in the high speed 
reproducing operation, even if the header portion 
cannot be reproduced, the reproduced data can be 
decoded. According to claim 16 of the present 
invention, since information of a header is the same in 
all frames and at least part of information of the 
header is recorded to the system area, the reproduced 
data can be more securely decoded in the high speed 
reproducing operation. 
Brief Description of Drawings 

Fig. 1 is a schematic diagram showing the 
hierarchical structure of a conventional MPEG2 stream. 

Fig. 2 is a schematic diagram showing the 
contents of data placed in an MPEG2 stream and bit 
assignments thereof . 

Fig. 3 is a schematic diagram showing the 
contents of data placed in an MPEG2 stream and bit 
assignments thereof . 

Fig. 4 is a schematic diagram showing the 
contents of data placed in an MPEG2 stream and bit 
assignments thereof . 

Fig. 5 is a schematic diagram showing the 
contents of data placed in an MPEG2 stream and bit 
assignments thereof . 
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Fig. 6 is a schematic diagram showing the 
contents of data placed in an MPEG2 stream and bit 
assignments thereof . 

Fig. 7 is a schematic diagram showing the 
5 contents of data placed in an MPEG2 stream and bit 

assignments thereof . 

Fig. 8 is a schematic diagram showing the 
contents of data placed in an MPEG2 stream and bit 
assignments thereof . 
10 Fig. 9 is a schematic diagram showing the 

contents of data placed in an MPEG2 stream and bit 
assignments thereof . 

Fig. 10 is a schematic diagram showing the 
contents of data placed in an MPEG2 stream and bit 
15 assignments thereof. 

Fig. 11 is a schematic diagram showing the 
contents of data placed in an MPEG2 stream and bit 
assignments thereof. 

Fig. 12 is a schematic diagram showing the 
20 contents of data placed in an MPEG2 stream and bit 

assignments thereof . 

Fig. 13 is a schematic diagram for explaining 
an arrangement of bytes of data. 

Fig. 14 is a schematic diagram showing the 
25 data structure of an MPEG stream according to the 

embodiment of the present invention. 

Fig. 15 is a block diagram showing an example 
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of the structure of a recording and reproducing 
apparatus according to the embodiment of the present 
invention . 

Fig. 16 is a schematic diagram showing an 
example of a format of tracks formed on a magnetic tape 
and the structure of data recorded in a system area. 

Fig. 17 is a schematic diagram for explaining 
an output method and a variable length encoding of a 
video encoder. 

Fig. 18 is a schematic diagram for explaining 
the rearrangement of an output sequence of the video 
encoder . 

Fig. 19 is a schematic diagram for explaining 
a process for packing the sequence rearranged data to 
sync blocks . 

Fig. 20 is a schematic diagram showing in 
reality the packing process. 

Fig. 21 is a block diagram showing a more 
practical structure of an ECC encoder. 

Fig. 22 is a schematic diagram showing an 
example of an address structure of a main memory. 

Fig. 23 is a block diagram showing an example 
of the structure for a recording process for the system 
area according to the embodiment of the present 
invention . 

Fig. 24 is a block diagram showing an example 
of the structure for a process for data reproduced from 



the system area according to the embodiment of the 
present invention . 

Fig. 25 is a flow chart for explaining the 
reproducing process shown in Fig. 24. 

Fig. 26 is a schematic diagram for explaining 
a problem solved by the present invention. 
Best Modes for Carrying out the Invention 

Next, an embodiment of the present invention 
will be described. The embodiment is applied to a 
digital VTR. The embodiment is suitable for the 
environment of a broadcasting station. 

According to the embodiment, for example 
MPEG2 is used as a compression formation. The MPEG2 is 
a combination of a motion compensation predictive 
encoding and a compression encoding using DCT . MPEG2 
data is hierarchically structured. As shown in Fig. 1, 
MPEG2 data is composed of a macro block layer (Fig. IE) 
as the lowest layer, a slice layer (Fig. ID), a picture 
layer (Fig. 1C), a GOP layer (Fig. IB), and a sequence 
layer (Fig. 1A) as the highest layer. 

As shown in Fig. IE, the macro block layer is 
composed of DCT blocks each of which is a data unit for 
DCT. The macro block layer is composed of a macro 
block header and a plurality of DCT blocks. As shown 
in Fig. ID, the slice layer is composed of a slice 
header portion and at least one macro block. As shown 
in Fig. 1C, the picture layer is composed of a picture 



header portion and at least one slice. One picture 
corresponds to one screen. As shown in Fig. IB, the 
GOP layer is composed of a GOP header portion, an I 
picture, a P picture, and a B picture. The I picture 
is a picture that has been intra- frame encoded. The P 
and B pictures are pictures that have been predictively 
encoded. 

According to the encoding syntax of MPEG, one 
GOP contains at least one I picture. However, one GOP 
may contain neither a P picture nor a B picture. As 
shown in Fig. 1A, the sequence layer, which is the 
highest layer, is composed of a sequence header portion 
and a plurality of GOPs. In the MPEG format, a slice 
is one variable length code sequence- The variable 
length code sequence is a sequence of which the 
boundary of data cannot be detected unless variable 
length code is correctly decoded. 

At the beginning of each of the sequence 
layer, the GOP layer, the picture layer, and the slice 
layer, a start code is placed. The start code is a 
predetermined bit pattern composed of bytes. The start 
code differs in each of the layers. Particularly, in 
the sequence layer, the start code is referred to as 
sequence header code. In each of the other layers, the 
start code is just referred to as start code. Each 
start code has a bit pattern of [00 00 01 xx] 
(hexadecimal notation). Thus, the bit pattern has four 
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set of two digits. In addition, [xx] represents that 
each layer has a unique bit pattern. 

In other words, each of the start codes and 
the sequence header code is composed of four bytes (= 
32 bits). Corresponding to the value of the fourth 
byte, the type of information that follows is 
identified. Since each of the start codes and the 
sequence header code is arranged in bytes, they can be 
acquired by matching a pattern of four bytes. 

The high order four bits of one byte preceded 
by the start code is an identifier the represents the 
content of an extension data area (that will be 
described later) . Corresponding to the value of the 
identifier, the content of the extension data can be 
identified. 

Each DCT block in the macro block layer or 
each DCT block in each macro block does not have an 
identification code having a predetermined bit pattern 
arranged in bytes . 

Next, the header portion of each layer will 
be described in detail. In the sequence layer shown in 
Fig. 1A, at the beginning, a header 2 is placed. The 
header 2 is followed by a sequence extension 3 and 
extension and user data 4. At the beginning of the 
sequence header 2, a sequence header code 1 is placed. 
Likewise, at the beginning of each of the sequence 
extension 3 and the user data 4, a predetermined start 



code (not shown) is placed. The area from the sequence 
header 2 to the extension and user data 4 is a header 
portion of the sequence layer. 

Fig. 2 shows contents and assigned bits of 
the sequence header 2. As shown in Fig. 2, the 
sequence header 2 contains sequence header code 1, 
encoded picture size (composed of the number of pixels 
in horizontal direction and the number of lines in 
vertical direction )~ aspect ratio, frame rate, bit rate, 
VBV (Video Buffering Verifier) buffer size, and 
quantizing matrix that are designated for each sequence 
with the designated numbers of bits. 

As shown in Fig. 3, the sequence extension 3 
preceded by the extension start code preceded by the 
sequence header designates additional data used in 
MPEG2. The addition data is for example profile, level, 
chroma (color difference) format, and progressive 
sequence. As shown in Fig. 4, the extension and user 
data 4 contains sequence indication ( ) and sequence 
scalable extension ( ) . The sequence indication ( ) 
contains information of RGB conversion characteristic 
of an original signal and display screen size. The 
sequence scalable extension ( ) designates a scalability 
mode and the layer of scalability. 

The header portion of the sequence layer is 
followed by GOPs . As shown in Fig. IB, at the 
beginning of each GOP, GOP header 6 and user data 7 are 



placed. The GOP header 6 and the user data 7 are a 
header portion of each GOP. As shown in Fig. 5, the 
GOP header 6 contains start code 5, time code, and 
flags representing independency and validity of the GOP 
with the designated numbers of bits. As shown in Fig. 
6, the user data 7 contains extension data and user 
data. At the beginning of each of the extension data 
and the user data, predetermined start code (not shown) 
is placed. 

The header portion of the GOP layer is 
followed by pictures. As shown in Fig. 1C, at the 
beginning of each picture, picture header 9, picture 
encoding extension 10, and extension and user data 11 
are placed. At the beginning of the picture header 9, 
picture start code 8 is placed. At the beginning of 
each of the picture encoding extension 10 and the 
extension and user data 11, a predetermined start code 
is placed. The area from the picture header 9 to the 
user data 11 is a header portion of each picture. 

As shown in Fig. 7, the picture header 9 
contains picture start code 8. In addition, in the 
picture header 9 , encoding condition for a screen is 
designated. As shown in Fig. 8, in the picture 
encoding extension 10, the range of a moving vector in 
the forward, backward, and horizontal/vertical 
directions is designated. In addition, the picture 
structure is designated. In the picture encoding 



extension 10, the accuracy of DC coefficients of an 
intra-macro block is designated; the VLC type is 
selected; the linear /non- linear quantizing scale is 
selected; and the scanning method in DCT is selected. 

As shown in Fig, 9, in the extension and user 
data 11, quantizing matrix, spatial scalable parameter, 
and so forth are designated. According to the encoding 
syntax of MPEG, they can be designated for each picture 
Thus, a picture can be encoded corresponding to 
characteristics of each screen. Moreover, in the 
extension and user data 11, the picture display area 
can be designated. Furthermore, in the extension and 
user data 11, copyright information can be designated. 

The header portion of the picture layer is 
followed by slices. As shown in Fig. ID, at the 
beginning of each slice, slice header 13 is placed. At 
the beginning of the slice header 13, slice start code 
12 is placed. As shown in Fig. 10, the slice start 
code 12 includes position information in the vertical 
direction of the current slice. In addition, the slice 
header 13 contains extended slice vertical position 
information, quantizing scale information, and so forth 

The header portion of the slice layer is 
followed by macro blocks (see Fig. IE). Each macro 
block contains a macro block header 14 and a plurality 
of DCT blocks. As was described above, the macro block 
header does not contain a start code. As shown in Fig. 



11, the macro block header 14 contains relative 
position information of the current macro block. In 
addition, in the macro block header 14, motion 
compensation mode and detail information about DCT 
encoding are designated. 

The macro block header 14 is followed by DCT 
blocks. As shown in Fig. 12, each DCT block contains 
variable -length code encoded DCT coefficients and data 
about DCT coefficients. 

In Fig. 1, solid line partitions of each 
layer represent that data is arranged in bytes, whereas 
dotted line partitions thereof represent that data is 
not arranged in bytes . In other words , as shown in Fig . 
13A, in higher layers up to the picture layer, the 
boundary of code is delimited in bytes. On the other 
hand, in the slice layer, only the slice start code 12 
is delimited in bytes. As shown in Fig. 13B, each 
macro block can be delimited in bits. Likewise, in the 
macro block layer, each DCT block can be delimited in 
bits. 

The data structure described with reference 
to Figs. 1 to 13 is a conventional MPEG data structure. 
According to the embodiment, to allow encoded data to 
be edited for each frame, all frames are intra- encoded. 
In addition, one GOP is composed of one I picture. 
Moreover, one slice is composed of one macro block. In 
such an MPEG bit stream, each item (flag) of the above- 



described header portion may be a fixed value. 

According to the encoding syntax of MPEG, the 
values of the header of the sequence layer and the 
header of the picture layer can be designated for each 
picture. However, according to the embodiment, to 
securely decode a reproduced picture in the high speed 
reproducing operation, the values of the header of the 
sequence layer and the header of the picture layer are 
the same in each frame. In reality, the MPEG encoder 
performs such an encoding process. 

Fig. 14 shows the headers of an MPEG stream 
according to the embodiment in reality. As is clear 
from Fig. 1, at the beginnings of the sequence layer, 
the GOP layer, the picture layer, the slice layer, and 
the macro block layer, the headers are placed. Fig. 14 
shows an example of a data arrangement starting with 
the sequence header portion. 

At the beginning, sequence header 2 having 
the length of 12 bytes is placed. The sequence header 
2 is followed by sequence extension 3 having the length 
of 10 bytes. The sequence extension 3 is followed by 
extension and user data 4 . At the beginning of the 
extension and user data 4, user data start code having 
the length of four bytes is placed. The user data 
start code is followed by user data area. The user 
data area contains information corresponding to SMPTE 
standard. 



The header portion of the sequence layer is 
followed by a header portion of the GPO layer. The 
header portion contains GPO header 6 having the length 
of eight bytes. The GOP header 6 is followed by 
extension and user data 7. At the beginning of the 
extension and user data 7, user data start code having 
the length of four bytes is placed. The user data 
start code is followed by user data area. The user 
data area contains information necessary for having 
compatibility with another conventional video format. 

The header portion of the GOP layer is 
followed by header portion of the picture layer. The 
picture portion contains picture header 9 having the 
length of nine bytes. The picture header 9 is followed 
by picture encoding extension 10 having the length of 
nine bytes. The picture encoding extension 10 is 
followed by extension and user data 11. The first 133 
bytes of the extension and user data 11 is extension 
and user data. The extension and user data is followed 
by user data start code 15 having the length of four 
bytes. The user data start code 15 is followed by 
information necessary for having compatibility with 
another conventional video format. The information is 
followed by user data start code 16. The user data 
start code 16 is followed by data corresponding to 
SMPTE standard. The header portion of the picture 
layer is followed by slices . 



Next, a macro block will be further described. 
Each macro block contained in the slice layer is a set 
of a plurality of DCT blocks. An encoded sequence of 
DCT blocks is composed of sets of runs and levels. A 
run represents the number of 0 f s as a quantized DCT 
coefficient. A level is immediately preceded by a run. 
A level represents a non-zero value as a quantized DCT 
coefficient. Neither each macro block nor each DCT 
block contained in each macro block does not contain 
identification code arranged in bytes. 

A macro block is formed by dividing one 
screen (picture) into a lattice of 16 pixels x 16 lines. 
A slice is formed by connecting macro blocks for 
example in the horizontal direction. The last macro 
block of one slice is continued to the first macro 
block of the next slice. Macro blocks between two 
slices are prohibited from being overlapped. The 
number of macro blocks per one screen depends on the 
size thereof . 

The number of macro blocks in the vertical 
direction of a screen is referred to as mb„height , 
whereas the number of macro blocks in the horizontal 
direction of a screen is referred to as mb_width. The 
coordinates of a macro block are defined as mb_height 
and mb_column. mb_height is the vertical position 
number of the current macro block counted from the 
upper edge of the screen, the upper edge being 0. 



mb__column is the horizontal position number of the 
current macro block counted from the left edge of the 
screen, the left edge being 0. The position of a macro 
block on the screen is represented with one variable as 
macroblock__address = mb_row x mb_width + mb_column. 

The order of slices and macro blocks of a 
stream is defined with macroblock_address . In other 
words, a stream is transmitted in the downward 
direction and leftward direction of the screen. 

In the MPEG, normally, one slice is composed 
of one stripe (16 lines). The variable length encoding 
starts at the left edge of the screen and ends at the 
right edge of the screen. Thus, when a VTR has 
recorded an MPEG elementary stream, if it is reproduced 
at high speed, the VTR mainly reproduces the left edge 
of the screen. Thus, the screen cannot be equally 
updated. In addition, since the position on the tape 
cannot be predicted, if a tape pattern is traced at 
predetermined intervals, the screen cannot be equally 
updated. Moreover, if at least one error takes place, 
it adversely affects until the right edge of the screen. 
Thus, until the next slice header is detected, the 
error continues. Consequently, one slice is composed 
of one macro block. 

Fig. 15 shows an example of the structure of 
the record side of a recording and reproducing 
apparatus according to the embodiment of the present 



invention. When the recording operation is performed, 
a digital signal is input from a terminal 100. The 
digital signal is supplied to an SDI (Serial Data 
Interface) receiving portion 101. The SDI is an 
interface defined by SMPTE so that a component video 
signal, a digital audio signal, and additional data can 
be transmitted. The SDI receiving portion 101 extracts 
a digital video signal and a digital audio signal from 
the input digital signal. The digital video signal is 
supplied to a MPEG encoder 102. The digital audio 
signal is supplied to an ECC encoder 109 through a 
delay 103. The delay 103 absorbs the time difference 
between the digital audio signal and the digital video 
signal. 

In addition, the SDI receiving portion 101 
extracts a synchronous signal from the input digital 
signal. The extracted synchronous signal is supplied 
to a timing generator 104. Alternatively, an external 
synchronous signal may be input from a terminal 105 to 
the timing generator 104. The timing generator 104 
generates timing pulses corresponding to a designated 
signal of the input synchronous signal and a 
synchronous signal supplied from a SDTI receiving 
portion 108 (that will be described later). The 
generated timing pulses are supplied to each portion of 
the recording and reproducing apparatus. 

The MPEG encoder 102 performs a DCT (Discrete 
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Cosine Transform) process for the input video signal so 
as to transform the input video signal into coefficient 
data and then encode the coefficient data with 
variable -length code* The variable- length code encoded 
data (VLC) data is an elementary stream (ES) 
corresponding to the MPEG2. The output is supplied to 
one of input terminals of a record side multif ormat 
converter (referred to as MFC) 106. 

On the other hand, data in SDTI (Serial-Data 
Transport Interface) format is input from an input 
terminal 107. The signal is synchronously detected by 
an SDTI receiving portion 108. Thereafter, the signal 
is temporarily stored in a buffer. Thereafter, the 
elementary stream is extracted from the buffer. The 
extracted elementary stream is supplied to another 
input terminal of the record side MFC 106. The 
synchronous signal that has been synchronously detected 
is supplied to the above -de scribed timing generator 104. 

According to the embodiment , to transmit an 
MPEG ES (MPEG elementary stream) , for example SDTI 
(Serial Data Transport Interface) - CP (Content 
Package) is used. The ES is 4:2:2 components. In 
addition, the ES is a stream composed of only I 
pictures. Moreover, the ES has the relation of 1 GOP = 
1 picture. In the SDTI - CP format, the MPEG ES is 
separated into access units and packed to packets 
corresponding to frames. In the SDTI - CP format, a 



sufficient transmission band (27 MHz or 36 MHz of clock 
rate or 270 Mbps or 360 Mbps of stream bit rate. Thus, 
in one frame period, the ES can be transmitted as a 
burst . 

In the area after SAV until EAV of one frame 
period, system data, video stream, audio stream, and 
AUX data are placed. Data is not equally placed in the 
entire frame period. Instead, in a predetermined 
period from the beginning, data is placed as a burst. 
At the boundary of a frame, an SDTI - CP stream (video 
and audio) can be switched in the form of a stream. In 
the SDTI - CP format, when contents use SMPTE time code 
corresponding to the clock, audio is synchronized with 
video. In addition, it is defined that SDTI -CP and SDI 
coexist . 

As in the case that a TS (Transport Stream) 
is transferred, the above -described interface 
corresponding to the SDTI - CP format does not need to 
cause an SDTI - CP steam to flow to a VBV (Video Buffer 
Verifier) buffer and TBs (Transport Buffers) of the 
encoder and the decoder. Thus, the delay of the stream 
can be decreased. In addition, since the SDTI - CP 
format allows a stream to be transferred at very high 
rate, the delay can be further decreased. Thus, in an 
environment of which there is a synchronization in the 
entire broadcasting station, the SDTI - CP format can 
be effectively used. 
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The SDTI receiving portion 108 further 
extracts a digital audio signal from the input SDTI - 
CP stream. The extracted digital audio signal is 
supplied to the ECC encoder 109. 

The record side MFC 106 contains a selector 
and a stream converter. The record side MFC 106 is 
disposed in for example one integrated circuit. Next, 
the process performed by the record side MFC 106 will 
be described. An MPEG ES supplied jErom the MPEG 
encoder 102 or an MPEG ES supplied from the SDTI 
receiving portion 108 is selected by the selector. The 
selected MPEG stream is processed by the record side 
MFC 106. 

The record side MFC 106 rearranges DCT 
coefficients of individual DCT blocks of one macro 
block arranged corresponding to the MPEG2 standard to 
DCT coefficients over all DCT blocks corresponding to 
frequency components. In addition, when one slice of 
an elementary stream is composed of one stripe, the 
record side MFC 106 converts the elementary stream so 
that one slice is composed of one macro block. 
Moreover, the record side MFC 106 limits the maximum 
length of the variable length data that takes place in 
one macro block to a predetermined length. This 
process is performed by designating 0 to high order DCT 
coefficients. Moreover, the record side MFC 106 
performs an interpolating process for the header of the 



sequence layer and the quantizing matrix for each 
picture of the MPEG bit stream. The converted 
elementary stream rearranged by the record side MFC 106 
is supplied to the ECC encoder 109. 

A main memory having a large storage capacity 
(not shown) is connected to the ECC encoder 109. The 
ECC encoder 109 comprises a packing and shuffling 
portion, an audio outer code encoder, a video outer 
code encoder, a video inner code encoder, an audio 
shuffling portion, a video shuffling portion, and so 
forth. The ECC encoder 109 comprises an ID adding 
circuit and a synchronous signal adding circuit. The 
ID adding circuit adds an ID to each sync block. The 
ECC encoder 109 is composed of for example one 
integrated circuit. 

According to the embodiment, error correction 
code used for video data and audio data is product code 
of which the video data or audio data is encoded with 
outer code in the vertical direction of a two 
dimensional array and the video data or audio data is 
encoded with inner code in the horizontal direction of 
the two dimensional array. Thus, with the product code, 
data symbols are dually encoded. As the outer code and 
inner code, Reed- Solomon code can be used. 

Next, the process performed by the ECC 
encoder 109 will be described. Since video data of a 
converted elementary stream has been encoded with 



variable length code, the data length of each macro 
block varies. The packing and shuffling portion packs 
each macro block in a fixed length . When a macro block 
cannot be packed in the fixed length, the overflow 
portion is packed to other areas that have spaces 
against the fixed length. 

In addition, system data containing 
information about picture format, version of shuffling 
pattern, and so forth is supplied from a system 
controller 121 (that will be described later). The 
system data is input from an input terminal (not shown) 
The system data is supplied to the packing and 
shuffling portion. The packing and shuffling portion 
performs a record process for the system data as with 
picture data. The system data is recorded as video AUX 
In addition, the packing and shuffling portion 
rearranges macro blocks of one frame that are generated 
in the scanning order and performs a shuffling process 
for dispersing the record positions of the macro blocks 
on the tape. Since the macro blocks are shuffled, even 
if data is partly reproduced when it is reproduced at 
high speed, the update ratio of the picture can be 
improved . 

The video data and system data supplied from 
the packing and shuffling portion (unless otherwise 
specified, even if video data contains system data, the 
video data is simply referred to as video data) is 
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supplied to the video outer code encoder that encodes 
video data with outer code. The video outer code 
encoder adds an outer code parity to the video data. 
An output of the outer code encoder is supplied to the 
video shuffling portion. The video shuffling portion 
performs a shuffling process for the output of the 
outer code encoder so as to change the order of sync 
blocks over a plurality of ECC blocks . Since sync 
blocks are shuffled, an error can be prevented from 
concentrating on a particular ECC block. The shuffling 
process performed by the shuffling portion may be 
referring to interleave. An output of the video 
shuffling portion is written to the main memory. 

On the other hand, as was described above, a 
digital audio signal that is output from the SDTI 
receiving portion 108 or the delay 103 is supplied to 
the ECC encoder 109. According to the embodiment, non- 
compressed digital audio signal is handled. 
Alternatively, the digital audio signal may be input 
through an audio interface. In addition, audio AUX is 
supplied from an input terminal (not shown) . The audio 
AUX is auxiliary data that contains information about 
audio data such as sampling frequency. The audio AUX 
is added to audio data and treated in the same manner 
as audio data. 

Audio data to which the audio AUX has been 
added (unless otherwise specified, referred to as audio 



data) is supplied to the audio outer code encoder that 
encodes audio data with outer code. An output of the 
audio outer code encoder is supplied to an audio 
shuffling portion. The audio shuffling portion 
shuffles the audio data. The audio shuffling portion 
shuffles the audio data for each sync block and for 
each channel. 

An output of the audio shuffling portion is 
written- to a main memory. As was described above, the 
output of the video shuffling portion is also written 
to the main memory. The main memory mixes the audio 
data and the video data as data of one channel. 

Data is read from the main memory. An ID 
that represents information of a sync block number is 
added to the data. The resultant data is supplied to 
the inner code encoder. The inner code encoder encodes 
the supplied data with inner code. A synchronous 
signal is added to an output of the inner code encoder 
for each sync block. As a result, record data as 
successive sync blocks is formed. 

Record data that is output from the ECC 
encoder 109 is supplied to an equalizer 110 that 
includes a recording amplifier and so forth. The 
equalizer 110 converts the supplied data into a record 
RF signal. The record RF signal is supplied to a 
rotating drum 111 on which a rotating head is disposed 
at a predetermined position and then recorded on the 



magnetic tape 112. In reality, a plurality of magnetic 
heads are disposed in such a manner that azimuths of 
heads that form adjacent tracks are different. 

When necessary, a scrambling process may be 
performed for the record data. When digital data is 
recorded, it may be digitally modulated. Moreover, 
partial response class 4 and Viterbi encoding may be 
used. The equalizer 110 contains both the structure 
for the record side and the structure for the 
reproduction side . 

Fig. 16 shows a track format in the case that 
an interlaced video signal whose frame frequency is 
29.97 Hz and whose size is 720 pixels (the number of 
effective horizontal pixels) x 480 lines (the number of 
effective lines) and PCM audio signals of four channels 
are recorded on a magnetic tape with a rotating head. 
In the example, video data and audio data for one frame 
are recorded with four tracks. Two tracks having 
different azimuths are paired. On each track, a record 
area for audio data (namely, audio sector) is formed at 
a nearly center portion. A video record area (video 
sector) is formed on both sides of the audio sector. 

In the example, audio data of four channels 
can be handled. Al to A4 represent channels 1 to 4 of 
audio data, respectively. Audio data is recorded in 
such a manner that the arrangement of audio data is 
changed in each set of two tracks having different 



azimuths. In the example, video data for four error 
correction blocks per track is interleaved. The 
interleaved data is divided into an upper side sector 
and a lower side sector and recorded. 

The lower side video sector has a system area 
(SYS) at a predetermined position. The system area is 
alternately formed on the beginning side and the end 
side of the lower side video sector of each track. 

In Fig. 16, SAT is an area in which a servo 
lock signal is recorded. Between each area, a gap 
having a predetermined size is formed. 

In Fig. 16, data for each frame is recorded 
with four tracks. Depending on the format of data that 
is recorded and reproduced, data for each frame may be 
recorded with eight tracks per frame, six tracks per 
frame, and so forth. 

As shown in Fig. 16B, data recorded on the 
tape is composed of a plurality of blocks formed at 
equal intervals. These blocks are referred to as sync 
blocks. Fig. 16C shows an outline of the structure of 
a sync block. A sync block is composed of sync pattern, 
ID, DID, data packet, and error correction inner code 
parity. The SYNC pattern is used to detect 
synchronization. The ID is used to identify the 
current sync block. The DID is used to represent the 
content of data that follows. Data is treated as 
packets corresponding to sync blocks . In other words , 



the minimum unit of data that is recorded or reproduced 
is one sync block. Many sync blocks (see Fig. 16B) 
compose for example a video sector. 

As shown in Fig. 16A, the system area is 
separated from the video data. When the rotating head 
reproduces a plurality of tracks at a time in the high 
speed reproducing operation, the system area can be 
almost securely reproduced. The high speed reproducing 
operation is an operation whose tape speed is for 
example twice or more higher than the tape speed of the 
recording operation. 

In the system area, data for one sync block 
as shown in Fig. 16D is recorded. The minimum data 
length is 109 bytes. The rest of the system area 
contains dummy data. The data of 109 bytes is composed 
of system data of 5 bytes, header data (Mpeg) of 2 
bytes, picture information (Picture Info) of 10 bytes, 
and user data of 92 bytes. The system data contains 
edit point information, picture format information such 
as line frequency, frame frequency, and aspect ratio, 
information representing properness of recorded MPEG 
elementary stream against syntax, information 
representing shuffling method, and so forth. 

As header data, at least part of information 
contained in the header of the sequence layer and the 
header of the sequence layer is recorded. Although the 
format of the bit stream that is recorded should 
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satisfy the encoding syntax of MPEG, as a required 
condition, data necessary for decoding the bit stream 
or data for creating the header of the sequence layer 
or the header of the picture layer can be recorded as 
header data of the system area. Thus, it is not 
required that header data recorded in the system area 
satisfy the encoding syntax of MPEG. According to the 
embodiment, since all frames are intra- encoded, one GOP 
is composed of one I picture, and one slice is one 
macro block, information of the header portion is fixed. 
Thus, it is not necessary to record such information. 

As picture information (Picture Info) of the 
system area, encoding information of the encoder of 
another digital VTR is recorded. As user data, serial 
number, model name, record year-month-day, and so forth 
of the digital VTR are recorded. 

According to the embodiment, as was described 
above, to securely obtain a decoded picture in the high 
speed reproducing operation, two countermeasures - one 
for causing the header of the sequence layer to be the 
same as the header of the picture layer in all frames 
and the other for causing the header information to be 
recorded to the system area - are performed. However, 
it is not necessary to perform both the methods 
together. Instead, with only one method, a picture can 
be effectively decoded in the high speed reproducing 
operation. For example, as long as header data of the 



stream can be securely reproduced in the high speed 
reproducing operation because of a particular packing 
method different from that of the embodiment, with only 
a countermeasure for causing the information of the 
headers of all frames to be the same can be performed. 

Returning to the description of Fig. 15, when 
a signal is reproduced from the magnetic tape 112, the 
signal reproduced from a magnetic tape 112 by the 
rotating drum 111 is supplied to a reproduction side 
structure of the equalizer 110 that includes a 
reproducing amplifier. The equalizer 110 performs an 
equalizing process, a waveform trimming process, and so 
forth for the reproduced signal. When necessary, the 
equalizer 110 performs a demodulating process, a 
Viterbi decoding process, and so forth for the 
reproduced signal. An output of the equalizer 110 is 
supplied to an ECC decoder 113. 

The ECC decoder 113 performs the reverse 
process of the ECC encoder 109. The ECC decoder 113 
comprises a main memory, an inner code decoder, an 
audio deshuf fling portion, a video deshuf fling portion, 
and an outer code decoder. The main memory has a large 
storage capacity. The ECC decoder 113 comprises a 
video deshuf fling and depacking portion and a video 
data interpolating portion. Likewise, the ECC decoder 
113 comprises an audio AUX separating portion and an 
audio data interpolating portion. The ECC decoder 113 
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is composed of for example one integrated circuit. 

Next, the process performed by the ECC 
decoder 113 will be described. The ECC decoder 113 
detects synchronization. In other words, the ECC 
decoder 113 detects a synchronous signal added at the 
beginning of a sync block and extracts a sync block. 
Each sync block of the reproduction data is supplied to 
the inner code decoder. The inner code decoder 
corrects an error of a sync block with inner code. For 
an output of the inner code decoder, an ID 
interpolating process is performed. The ID (for 
example, the sync block number) of a sync block from 
which an error is detected with inner code is 
interpolated. The resultant reproduced data is 
separated into video data and audio data. 

As was described above, the video data 
represents both DCT coefficient data generated in the 
MPEG intra- encoding process and system data. Likewise, 
the audio data represents PCM (Pulse Code Modulation) 
data and audio AUX. 

The separated audio data is supplied to the 
audio deshuf fling portion. The audio deshuf fling 
portion performs the reverse process of the shuffling 
process performed by the record side shuffling portion. 
An output of the deshuffling portion is supplied to the 
audio outer code decoder. The outer code decoder 
corrects an error of the audio data with outer code. 



The audio outer code decoder outputs error- corrected 
audio data. When an error of audio data cannot be 
corrected, an error flag is set. 

An output of the audio outer code decoder is 
supplied to the audio AUX separating portion. The 
audio AUX separating portion separates audio AUX from 
the audio data that is output from the audio outer code 
decoder. The separated audio AUX is output from the 
ECC decoder 113 (the route is not shown). The audio 
AUX is supplied to the data interpolating portion. The 
data interpolating portion interpolates a sample 
containing an error. The interpolating method is for 
example average value interpolating method, preceding 
value hold method, or the like. In the average value 
interpolating method, a sample containing an error is 
interpolated with an earlier correct sample and a later 
correct sample. In the preceding value hold method, a 
preceding correct value is held. 

An output of the data interpolating portion 
is an output of audio data that is output from the ECC 
decoder 113. The audio data that is output from the 
ECC decoder 113 is supplied to a delay 117 and an SDTI 
output portion 115. The delay 117 absorbs the delay in 
the process for video data performed in an MPEG decoder 
116. The delay 117 delays the audio data by a 
predetermined time period and supplies the delayed 
audio data to an SDI output portion 118. 



The separated video data is supplied to the 
deshuf fling portion. The deshuf fling portion performs 
the reverse process of the shuffling process performed 
by the record side shuffling portion. The deshuf fling 
portion restores the sync blocks shuffled by the record 
side shuffling portion to the original sync blocks. An 
output of the deshuffling portion is supplied to the 
outer code decoder. The outer code decoder corrects an 
error of each sync block with outer code. When an 
uncorrectable error takes place, an error flag that 
represents that there is an error is set. 

An output of the outer code decoder is 
supplied to the deshuffling and depacking portion. The 
deshuffling and depacking portion restores macro blocks 
shuffled by the record side packing and shuffling 
portion to the original macro blocks. In addition, the 
deshuffling portion and depacking portion depacks 
packed macro blocks. In other words, the deshuffling 
portion and depacking portion restores fixed length 
code of each macro block to the original variable 
length code. In addition, the deshuffling and 
depacking portion separates the system data from the 
video data. The system data is output from the ECC 
decoder 113 and supplied to a system controller 121. 

An output of the deshuffling and depacking 
portion is supplied to the data interpolating portion. 
The data interpolating portion corrects data to which 



an error flag has been set (namely, data having an 
error). In other words, before the conversion is 
performed, if macro block data has an error, DCT 
coefficients of frequency components after the position 
of the error cannot be corrected. In such a case, for 
example, data at the position of the error is 
substituted with block end code (EOB) so that DCT 
coefficients of the subsequent frequency components 
become zero. Likewise, when video data is reproduced 
at high speed, only DCT coefficients corresponding to 
the sync block length are restored. The other 
coefficients are substituted with zero data. In 
addition, the data interpolating portion performs a 
header recovering process for a header at the beginning 
of video data (sequence header, GOP header, picture 
header, user data, and so forth) when the header has an 
error . 

Since DCT coefficients are arranged from DC 
components to higher frequency components over all DCT 
blocks, even if DCT coefficients after a particular 
point are omitted, DC components and lower frequency 
components can be equally placed in individual DCT 
blocks that compose a macro block. 

Video data that is output from the data 
interpolating portion of the ECC decoder 113 is 
supplied as an output of the ECC decoder 113. The 
outputs of the ECC decoder 113 are supplied to a 



reproduction side multi-format converter (hereinafter 
abbreviated to reproduction side MFC) 114. The 
reproduction side MFC 114 performs the reverse process 
of the above -described record side MFC 106. The 
reception side MFC 114 includes a stream converter. 
The reception side MFC 114 is composed of for example 
one integrated circuit. 

The stream converter performs the reverse 
process of the record side stream converter. In other 
words, the stream converter rearranges DCT coefficients 
arranged over a plurality of DCT blocks corresponding 
to frequency components to DCT coefficients in each DCT 
block. Thus, the reproduced signal is converted into 
an MPEG2 elementary stream. 

On the other hand, as with the record side, 
the input and output of the stream converter should 
have a sufficient transmission rate (band width) 
corresponding to the maximum length of macro blocks . 
When the length of each macro block (slice) is not 
limited, it is preferred to secure the band width three 
time larger than the pixel rate. 

An output of the stream converter of the 
reception side MFC 114 is an output of the reception 
side MFC 114. The output of the reception side MFC 114 
is supplied to the SDTI output portion 115 and the MPEG 
decoder 116 . 

The MPEG decoder 116 decodes the elementary 



stream and outputs video data. The elementary stream 
is supplied to the MPEG decoder 116. The MPEG decoder 
116 performs a pattern matching for the elementary- 
stream and detects sequence header code and start code 
therefrom. Corresponding to the detected sequence 
header code and start code, the MPEG decoder 116 
extract encoding parameters contained in the header 
portion of each layer. Corresponding to the extracted 
encoding parameters, the MPEG decoder 116 performs an 
inverse quantizing process and an inverse DCT process 
for the elementary stream. 

The decoded video data that is output from 
the MPEG decoder 116 is supplied to the SDI output 
portion 118. As described above, the audio data that 
has been separated from the video data by the ECC 
decoder 113 is supplied to the SDI output portion 118 
through the delay 117. The SDI output portion 118 maps 
the supplied video data and audio data in the SDI 
format and outputs a stream having the data structure 
of the SDI format. The stream is output from the SDI 
output portion 118 to the outside through an output 
terminal 120. 

On the other hand, the audio data separated 
from the video data by the ECC decoder 113 is supplied 
to the SDTI output portion 115. The SDTI output 
portion 115 maps the video data and audio data to the 
SDTI format so as to converts them to a stream having a 
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data structure of the SDTI format. The converted 
stream is output to the outside through an output 
terminal 119 . 

When an external device to which an SDTI 
stream has been supplied from the output terminal 119 
needs to perform an MPEG decoding process, the external 
device performs a pattern matching for the supplied 
stream, detects sequence start code and start code, and 
extracts encoding parameters of the header portion o£_ 
each layer. Corresponding to the extracted encoding 
parameters, the external device decodes the supplied 
SDTI stream. 

In Fig. 15, the system controller 121 is 
composed of for example a micro computer. The system 
controller 121 controls the entire operations of the 
recording and reproducing apparatus. A servo 122 
communicates with the system controller 121 so as to 
control the traveling of the magnetic tape 112 and the 
driving of the rotating drum 111. 

Fig. 17A shows the order of DCT coefficients 
of video data that is output from the DCT circuit of 
the MPEG encoder 102. This order applies to an MPEG ES 
that is output from the SDTI receiving portion 108. In 
the following description, an output of the MPEG 
encoder 102 will be described as an example. DCT 
coefficients are zigzag- scanned and output starting 
from the upper left DC component of the DCT block in 



the directions of which horizontal and vertical spatial 
frequencies become high. Thus, as shown in Fig. 17B, a 
total of 64 (8 pixels x 8 lines) DCT coefficients are 
arranged in the order of frequency components. 

The DCT coefficients are variable -length 
encoded by the VLC portion of the MPEG encoder. In 
other words, the first coefficient is fixed as a DC 
component. The next components (AC components) are 
assigned code corresponding to sets of runs" of zeros 
and levels. Thus, the variable length code encoded 
output of coefficient data of AC components is a 
sequence of AC i# AC 2 , AC 3 , . . . and so forth in the order 
from lower frequency (low order) components to higher 
frequency (high order) components. The elementary 
stream contains variable length code encoded DCT 
coefficients . 

The record side stream converter that is 
built in the record side MFC 106 rearranges DCT 
coefficients of the supplied signal. In other words, 
the DCT coefficients arranged in the order of frequency 
components in each DCT block by the zigzag scanning are 
rearranged in the order of frequency components over 
all DCT blocks that composes the macro block. 

Fig. 18 shows an outline of the rearrangement 
of DCT coefficients performed by the record side stream 
converter. In the case of a (4:2:2) component signal, 
one macro block is composed of four DCT blocks (Yi, Y 2 , 

48 



Y 3 , and Y 4 ) of the luminance signal Y , two DCT blocks 
(Cbi and Cb 2 ) of the color difference signal Cb, and 
two DCT blocks (Cr x and Cr 2 ) of the color difference 
signal Cr. 

As was described above, the MPEG encoder 102 
zigzag- scans DCT coefficients corresponding to the 
MPEG2 standard. As shown in Fig. 18A, DCT coefficients 
are arranged in the order of a DC component and AC 
components from the lowest frequency component to the 
highest frequency component for each DCT block. After 
one DCT block has been zigzag- scanned, the next DCT 
block is zigzag- scanned. Likewise, DCT coefficients 
are arranged. 

In other words, in a macro block, in each of 
the DCT blocks Yi, Y 2 , Y 3 , and Y 4 and the DCT blocks Cbi, 
Cb 2 , Cr i# and Cr 2 , DCT coefficients are arranged in the 
order of a DC component and AC components from the 
lowest frequency component to the highest frequency 
component. Variable length code encoding is performed 
in such a manner that sets of runs and levels are 
assigned code such as [DC, ACi, AC 2 , AC 3 , . .. ]. 

The record side stream converter interprets 
the DCT coefficients encoded with variable length code, 
detects the delimitation of each coefficient, and 
arranges DCT coefficients over all DCT blocks of the 
macro block corresponding to frequency components. Fig 
18B shows such a process. Firstly, the stream 
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converter arranges DC components over eight DCT blocks 
of the macro blocks. Thereafter, the stream converter 
arranges the lowest frequency AC components over the 
eight DCT blocks. Likewise, the stream converter 
arranges AC coefficient data over the eight DCT blocks 
corresponding to each order component . 

The coefficient data is rearranged in the 
order of DC {Y ± ) , DC (Y 2 ), DC (Y 3 ) , DC (Y 4 ) , DC (Cb x ) , 
DC (Cri), DC (Cb 2 ), DC (Cr 2 ),~ACi (Y x ), AC X (Y 2 ), AC X (Y 3 ) 
ACi (Y 4 ), ACi (Cbi), ACi (Cri) , ACi (Cb 2 ) , AC X (Cr 2 ) , ... 
and so forth. As was described with reference to Fig. 
17, DC, ACi, AC 2 , and so forth are variable length 

code assigned to sets of runs and levels . 

The converted elementary stream of which the 
order of coefficient data has been changed by the 
record side stream converter is supplied to the packing 
and shuffling portion of the ECC encoder 109. The data 
length of each macro block of the converted elementary 
stream is the same as the data length of each macro 
block of non-converted elementary stream. In addition, 
although the MPEG encoder 102 fixes the length of each 
GOP (one frame) by the bit rate control, the length of 
each macro block varies. The packing and deshuf fling 
portion packs data of the macro block to a fixed length 

Fig. 19 shows an outlines of a packing 
process for macro blocks performed by the packing and 
shuffling portion. Macro blocks are packed to a 



predetermined fixed length. The fixed length 
corresponds to the data length of a sync block that is 
the minimum unit of data that is recorded and 
reproduced. Thus, the shuffling and error correction 
code encoding process can be easily performed. For 
simplicity, in Fig. 19, it is assumed that one frame 
contains eight macro blocks . 

As shown in Fig. 19A, since the variable 
length encoding process -is performed for eight macro 
blocks, their lengths are different each other. In 
this example, the length of each of macro blocks #1, #3, 
and #6 is larger than the fixed length as the length of 
the data area of one sync block. The length of each of 
macro blocks #2, #5, #7, and #8 is smaller than the 
fixed length. The length of macro block #4 is almost 
equal to the length of one sync block. 

The packing process packs a macro block to 
the fixed length of the length of one sync block. This 
is because the amount of data generated in one frame 
period is fixed. As shown in Fig. 19B, when the length 
of a macro block is longer than the length of one sync 
block, the macro block is divided at the position of 
the length of one sync block. The portion that exceeds 
the length of one sync block (namely, overflow portion) 
is successively packed to macro blocks each of which 
does not exceed the length of one sync block in such a 
manner that the overflow portion is preceded by those 



macro blocks . 

In the example shown in Fig. 19B, in the 
macro block #1, the portion that exceeds the length of 
one sync block is packed after the macro block #2. 
When the length of the resultant macro block #2 becomes 
the same as the length of one sync block, the remaining 
portion is packed after the macro block #5. Next, in 
the macro block #3, the portion that exceeds the length 
of one sync block is packed after the macro block #7. 
Thereafter, in the macro block #6, the portion that 
exceeds the length of one sync block is packed after 
the macro block #7. The remaining portion is packed 
after the macro block #8. In such a manner, each macro 
block is packed to the fixed length as the length of 
one sync block. 

The record side stream converter can 
predetermine the length of variable length data of each 
macro block in advance. Thus, the packing portion can 
know the end of data of each macro block without need 
to decode VLC data and check the content . 

Fig. 20 shows the packing process for data 
for one frame in more detail. In the shuffling process, 
macro blocks MB1, MB2, MB3, MB4 , ... dispersed on the 
screen as shown in Fig- 20A are successively arranged 
as shown in Fig. 25B. The header of the sequence layer 
and the header of the picture layer are added to each 
frame. The header portion composed of such headers is 



treated as a top macro block and packed. As shown in 
Fig. 20C, the overflow portion that extrudes from the 
fixed length (sync block length) is packed to areas 
that have a space. In Fig. 25B, the overflow portions 
are denoted by 300, 301, and 302. The packed data in 
such a manner is recorded on the magnetic tape 112 as 
shown in Fig. 20D. 

In the high speed reproducing operation of 
which a magnetic tape is traveled at a higher speed 
than the recording operation, the rotating head traces 
a plurality of tracks at a time. Thus, the reproduced 
data contains data of different frames. When the 
reproducing operation is performed, the depacking 
process is performed in the reverse manner as the 
packing process. When the depacking process is 
performed, all data for one frame should have been 
arranged. When data of a plurality of frames is mixed 
as in the high speed reproducing mode, the depacking 
process cannot be performed. Thus, in the high speed 
reproducing operation, only data that does not protrude 
from the fixed length is used rather than overflow data. 

Thus, since the data length of the header 
portion is larger than the sync block length, as shown 
in Fig. 20B, data 300 that protrudes from the fixed 
length cannot be used in the high speed reproducing 
operation. Thus, the data of the header portion cannot 
be fully reproduced. However, according to the 



embodiment, since the system area contains information 
necessary for the decoding process and the rotating 
head almost securely traces the system area in the high 
speed reproducing operation, pictures can be decoded in 
the high speed reproducing mode. 

Fig. 21 shows a more practical structure of 
the above -described ECC encoder 109. In Fig. 21, 
reference numeral 164 is an interface for an external 
main memory 160 connected to the IC. The main memory 
160 is composed of an SD RAM. The interface 164 
arbitrates a request from the ECC encoder 109 to the 
main memory 160 and performs a read/write process from 
and to the main memory 160. In addition, a packing 
portion 137a, a video shuffling portion 137b, and a 
packing portion 137c compose a packing and shuffling 
portion. 

Fig. 22 shows an example of the address 
structure of the main memory 160. The main memory 160 
is composed of for example a 64 -Mbit SDRAM. The main 
memory 160 comprises a video area 250, an overflow area 
251, and an audio area 252. The video area 250 is 
composed of four banks (vbank #0, vbank #1, vbank #2, 
and vbank #3). Each of the four banks can store a 
digital video signal for one fixed length unit. One 
fixed length unit is a unit of which the amount of 
generated data is controlled to an almost target value. 
One equal length unit is for example one picture (I 
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picture) of a video signal- In Fig. 22, portion A 
represents a data portion of one sync block of a video 
signal. One sync block contains data of bytes that 
depend on the format that is used. To deal with a 
plurality of formats, the data size of one sync block 
is larger than the maximum number of bytes of sync 
blocks of individual formats. For example, the number 
of bytes of one sync block is 256 bytes. 

Each bank of the video area is divided into a 
packing area 250A and an inner code encoder output area 
250B. The overflow area 251 is composed of four banks 
corresponding to the above -described video area. The 
main memory 160 has an audio data processing area 252. 

According to the embodiment, with reference 
to a data length sign of each macro block, the packing 
portion 137a stores the fixed length data and overflow 
data that exceeds the fixed length to different areas 
of the main memory 160. The fixed length data is data 
that does not exceed the length of the data area of a 
sync block. Hereinafter, the fixed length data is 
referred to as block length data. The block length 
data is stored in the packing area 250A of each bank. 
When the length of a macro block is smaller than the 
block length, the corresponding area of the main memory 
160 has a blank region. The video shuffling portion 
137b controls the write addresses so as to shuffle 
macro blocks. The video shuffling portion 137b 



shuffles only block length data rather than overflow 
portions. The overflow portions are written to an area 
assigned to the overflow data. 

Next, the packing portion 137c packs overflow 
portions to a memory of an outer code encoder 139. In 
other words, the packing portion 137c reads data having 
the block length from the main memory 160 to a memory 
for one ECC block of the outer code encoder 139. When 
the block length data has a blank region, the packing 
portion 137c packs the overflow portion to the block 
length data having the blank region. After the packing 
portion 137c has read data for one ECC block, it 
temporarily stops reading data. The outer code encoder 
139 generates an outer code parity. The outer code 
parity is stored to the memory of the outer code 
encoder 139. After the outer code encoder 139 has 
completed the process for one ECC block, data and outer 
code parities that are output from the outer code 
encoder 139 are rearranged in the order of the inner 
code encoding. The resultant data is written again to 
an output area 2 SOB that is different from the packing 
process area 250A of the main memory 160. A video 
shuffling portion 140 controls the addresses of the 
main memory 160 at which data that has been encoded 
with outer code is written so as to shuffle sync blocks. 

In such a manner, block length data and 
overflow data are separated. The block length data is 



written to the first area 250A (as first packing 
process ) . The overflow data is packed to the memory of 
the outer code encoder 139 (as second packing process). 
The outer code parity is generated. The data and outer 
code parity are written to the second area 250B of the 
main memory 160. Those processes are performed for 
each ECC block. Since the outer code encoder 139 has a 
memory having the size of one ECC block, the access 
frequency against the main memory 160 can be decreased. 

After a predetermined number of ECC blocks 
contained in one picture (for example, 32 ECC blocks) 
have been processed, the packing process and the outer 
code encoding process for one picture are completed. 
Data that is read from the area 2 SOB of the main memory 
160 is processed by an ID adding portion 148, an inner 
code encoder 147, and a synchronization adding portion 
150. A parallel- serial converting portion 124 converts 
output data of the synchronization adding portion 150 
into bit serial data. The output serial data is 
processed by a partial response class 4 precoder 125. 
When necessary, the output is digitally modulated. The 
resultant data is supplied to a rotating head disposed 
on the rotating drum 111. 

A sync block that does not have valid data 
(this sync block is referred to as null sync) is placed 
in an ECC block so that ECC blocks can become flexible 
against the difference of formats of record video 



signals . A null sync is generated by the packing 
portion 137a of the packing and shuffling portion 137. 
The generated null sync is written to the main memory 
160. Thus, since the null sync has a data record area, 
it can be used as a record sync for an overflow portion - 

In the case of audio data, even number 
samples and odd number samples of audio data of one 
field form different ECC blocks. Since an ECC outer 
code sequence is composed of audio samples in the input 
order, whenever an audio sample of an outer code 
sequence is input, an outer code encoder 136 generates 
an outer code parity. A shuffling portion 147 controls 
the addresses of the audio data processing area 252 of 
the main memory 160 against an output of the outer code 
encoder 136 so as to shuffle it (in each channel and in 
each sync block) . 

In addition, a CPU interface 126 is disposed. 
The CPU interface 126 receives data from an external 
CPU 127 that functions as a system controller and 
designates parameters for the internal blocks. To 
handle a plurality of formats, the CPU interface 126 
can designate many parameters such as sync block length, 
parity length. 

"Packing length data" as a parameter is sent 
to the packing portions 137a and 137b. The packing 
portion 137a and 137b each pack VLC data in the fixed 
length (that is a length represented as "payload 
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length" shown in Fig. 19A) designated corresponding to 
the parameter "packing length data" . 

"Number of packs data" as a parameter is sent 
to the packing portion 137b, The packing portion 137b 
designates the number of packs per sync block 
corresponding to the parameter "number of packs data". 
Data for the designated number of packs is supplied to 
the outer code encoder 139. 

"Number of video outer code parities data" as 
a parameter is sent to the outer code encoder 139. The 
outer code encoder 139 encodes video data having 
parities corresponding to the parameter "number of 
video outer code parities data" with outer code. 

"ID information" and "DID information" as 
parameters are sent to an ID adding portion 148. The 
ID adding portion 148 adds the ID information and the 
DID information to a data sequence having a unit length 
that is read from the main memory 160. 

"Number of video inner code parities data" 
and "number of audio inner code parities data" as 
parameters are sent to the inner code encoder 149. The 
inner code encoder 149 encodes video data and audio 
data having parities corresponding to the parameters 
"number of video inner code parities data" and "number 
of audio inner code parities data" with inner code. In 
addition, "sync length data" as a parameter is sent to 
the inner code encoder 149. Thus, the unit length 



(sync length) of data that has been encoded with inner 
code is limited. 

In addition, shuffling table data as a 
parameter is stored to a video shuffling table (RAM) 
128v and an audio shuffling table (RAM) 128a. The 
shuffling table 128v performs an address conversion for 
the video shuffling portions 137b and 140. The 
shuffling table 128a performs an address conversion for 
the audio shuffling 137. 

As described above, according to the 
embodiment of the present invention, a record data area 
(video sector and audio sector) and a system area (sys) 
are formed as separate areas. At least part of the 
header portion (the header of the sequence layer and 
the header of the picture layer) is recorded in the 
system area. Fig. 23 shows the structure of the 
recording process for the system area. The structure 
is disposed in the record side MFC 106. 

In Fig. 23, reference numeral 51 represents 
one input stream selected from MPEG streams supplied 
from the MPEG encoder 102 and the SDTI receiving 
portion 108. The input stream 51 may be a stream of 
which coefficient data has been rearranged. The input 
stream 51 is directly output as an output stream 52 for 
a recording process. In addition, the input stream 51 
is supplied to a detecting portion 53. 

The detecting portion 53 detects the header 



of the sequence layer and the header of the picture 
layer from the input stream 51, detects all information 
of the headers or part of information necessary for the 
decoding process, and separates the detected 
information from the input stream 51. For example, the 
detecting portion 53 detects as information contained 
in the header of the sequence layer the number of 
pixels, bit rate, profile, level, color difference 
format, progressive sequence, and so forth and as 
information contained in the header of the picture 
layer, information (flags) representing the setting of 
the accuracy of DC (Direct Current) coefficient of an 
intra macro block, the designations of the frame 
structure, field structure, and display field, the 
selection of the quantizing scale, the selection of the 
VLC type, the selection of the zigzag/alternate 
scanning, and the designations of the chroma format and 
so forth. 

Information 54 separated by the detecting 
portion 53 is recorded to the system area. In this 
case, as was described with reference to Fig. 16, a 
signal process is performed so that data of a sync 
block is structured along with other information 
recorded in the system area and that the sync block is 
recorded in the system area at a predetermined position 
of a video sector. 

Fig. 24 shows a signal processing portion 



disposed in the reproduction side MFC 114. An input 
stream denoted by 61 is a stream reproduced from a 
magnetic tape* An input denoted by 62 is data 
reproduced from the system area. These data is 
supplied to a selector 63. The selector 63 has both a 
function for creating the header of the sequence layer 
and the header of the picture layer corresponding to 
reproduced data 62 of the system area and a function 
for output ting one of the input stream 61 and a stream 
with a header corresponding to the header of the input 
stream 61 as an output stream 65. The selecting 
operation of the selector 63 is controlled 
corresponding to a mode 64. The mode 64 is data that 
represents the operation mode of the digital VTR. The 
mode 64 is output from the system controller 121 
corresponding to the key operation of the user and so 
forth. 

Fig. 25 is a flow chart showing a signal 
process on the reproduction side. First of all, at 
step SI, it is determined whether or not the 
reproducing operation is high speed reproduction mode 
corresponding to the mode 64. When the reproducing 
operation is not the high speed reproducing mode, the 
header (the header of the sequence layer and the header 
of the picture layer) contained in the input stream 61 
is used as the header of the output stream 65 (at step 
62) . 



When the determined result at step SI 
represents that the reproducing operation is the high 
speed reproducing mode corresponding to the mode 64, 
the header (the header of the sequence layer and the 
header of the picture layer) is created corresponding 
to data reproduced from the system area (at step S3). 
The selector 63 outputs an output stream 65 of which 
the created header has been added to the input stream 
61. As a result of such a process, the output stream 
65 that is output from the selector 63 securely 
contains the header in the high speed reproducing mode. 

In the high speed reproducing mode, the input 
stream 61 may contain a header. In such a case, the 
header may be output as a valid header. However, when 
the packing process is performed according to the 
embodiment, since the depacking process is not 
performed in the high speed reproducing mode, the 
header created corresponding to the reproduced data of 
the system area is used. 

In the above example, both the header of the 
sequence layer and the header of the picture layer are 
treated as fixed values and recorded to the system area. 
However, since it is rarely that the header of the 
sequence layer is varied for each picture, the present 
invention may be applied to the case that only the 
header of the picture layer is considered. 

In the above example, the present invention 



is applied to a digital VTR that records MPEG and JPEG 
data streams. In addition, the present invention can 
be applied to compression encoding having another 
hierarchical structure . 

As was described above, according to the 
present invention, the information of headers of all 
frames is the same. Thus, even in the high speed 
reproducing operation of which a stream of one frame is 
composed of fragmental data of different frames, the 
reproduced stream can be securely decoded. 



